Arm backend: Add experimental support for new TOSAQuantizer by AdrianLundell · Pull Request #18100 · pytorch/executorch

AdrianLundell · 2026-03-11T12:32:12Z

Allows initializing TOSA/EthosU/Vgf quantizers with use_composable_quantizer=True to use a new implementation of the quantizer following the Cortex-M. See
#17701 for more details.

Creates a new temporary TOSAQuantizer API layer for switching between the two versions
Adds a TOSAQuantizationConfig encapturing TOSA-specific qspec requirements for certain ops.
Adds quantizer_support.py for defining what operators are supported by the quantizer.
Align mark_node_as_annotated in cortex-m backend to TOSAQuantizer behaviour.
Update quantizer reporter to handle TOSA qspecs as they are dynamically created.

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell

Allows initializing TOSA/EthosU/Vgf quantizers with use_composable_quantizer=True to use a new implementation of the quantizer following the Cortex-M. See pytorch#17701 for more details. - Creates a new temporary TOSAQuantizer API layer for switching between the two versions - Adds a TOSAQuantizationConfig encapturing TOSA-specific qspec requirements for certain ops. - Adds quantizer_support.py for defining what operators are supported by the quantizer. - Align mark_node_as_annotated in cortex-m backend to TOSAQuantizer behaviour. - Update quantizer reporter to handle TOSA qspecs as they are dynamically created. Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: Icbca66ff86e6f78ffa1c8dcec55e17c25f97d8ca

pytorch-bot · 2026-03-11T12:32:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18100

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 3 Unrelated Failures

As of commit 885a81b with merge base b836a57 ():

NEW FAILURES - The following jobs have failed:

pull / unittest-editable / macos / macos-job (gh)
export/tests/test_target_recipes.py::TestTargetRecipes::test_mv3_model
trunk / test-llama-runner-qnn-linux (fp32, qnn_16a16w, qnn) / linux-job (gh)
RuntimeError: Command docker exec -t 14b3a01c60ddd9ccc7f1c0cd8bdc5ae899b26e626722261afad6465c29388fac /exec failed with exit code 1
trunk / unittest-release / linux / linux-job (gh)
examples/models/test/test_export.py::ExportTest::test_ic3_export_to_executorch

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-binary-size-linux-gcc / linux-job (gh) (trunk failure)
/pytorch/executorch/kernels/portable/cpu/op_convolution.cpp:175:41: error: comparison of integer expressions of different signedness: ‘ssize_t’ {aka ‘long int’} and ‘size_t’ {aka ‘long unsigned int’} [-Werror=sign-compare]
trunk / test-arm-cortex-m-size-test (bare_metal) / linux-job (gh) (trunk failure)
/pytorch/executorch/../executorch/kernels/portable/cpu/util/kernel_ops_util.h:233:54: error: comparison of integer expressions of different signedness: 'ssize_t' {aka 'int'} and 'size_t' {aka 'unsigned int'} [-Werror=sign-compare]
trunk / test-arm-cortex-m-size-test (zephyr-preset) / linux-job (gh) (trunk failure)
/pytorch/executorch/../executorch/kernels/portable/cpu/util/kernel_ops_util.h:233:54: error: comparison of integer expressions of different signedness: 'ssize_t' {aka 'int'} and 'size_t' {aka 'unsigned int'} [-Werror=sign-compare]

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: Id81e0c39d13a94a749206441fce60664c80a0af8

zingo · 2026-03-11T14:50:47Z

Hi @SS-JIA / @digantdesai this adds a file, do you want/need to check this?
This is also something we would like to get into 1.2 if possible.

AdrianLundell · 2026-03-11T15:59:23Z

Fails unrelated

rascani · 2026-03-12T00:04:09Z

backends/arm/quantizer/__init__.py


+# Lazily import heavy quantizer classes to avoid circular imports with
+# Cortex-M quantization configs.
+_LAZY_EXPORTS = {


_LAZY_IMPORTS?

This is a workaround since the import situation is a bit messy with imports across the cortex-m and arm backend. The idea is to clean this up when things have stabilized, I didn't want to move things in this commit to make the diff cleaner, I hope this is OK.

backends/arm/quantizer/quantizer_support.py

backends/cortex_m/quantizer/quantizer_reporter.py

backends/cortex_m/quantizer/quantizer.py

backends/cortex_m/quantizer/quantizer_reporter.py

backends/arm/quantizer/TARGETS

digantdesai · 2026-03-13T15:13:32Z

I like the overall direction, and the Cortex-M quantizer reuse. Also the priority for the composition.
I will let RJ review and stamp, as I am still catching up.

One thing I would say is, add an e2e test with both Ethos and Cortex-M quantizers and run the test on FVP.

- Spelling errors - Buck fixes - E2E model test on fvp with new quantizer Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: I43dadbcec22b3b28c65a1c9708790881dd3868a2

AdrianLundell · 2026-03-13T16:11:53Z

I hope this is what you had in mind for the test @digantdesai, we have also run all tests internally with the new quantizer set to true sucessfully. The idea is that we can let you and others try this out and give feedback and then there should be minimal disruption as we flip the flag to True as default.

meta-codesync · 2026-03-13T16:16:43Z

@rascani has imported this pull request. If you are a Meta employee, you can view this in D96368239.

AdrianLundell requested review from digantdesai and rascani as code owners March 11, 2026 12:32

AdrianLundell added partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm ciflow/trunk release notes: arm Changes to the ARM backend delegate labels Mar 11, 2026

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 11, 2026

Merge branch 'main' into change-1183485

5431876

zingo added this to the 1.2.0 milestone Mar 11, 2026

Fix lint error + BUCK

f3d752e

Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: Id81e0c39d13a94a749206441fce60664c80a0af8

Merge branch 'main' into change-1183485

5c02aa7

rascani approved these changes Mar 12, 2026

View reviewed changes

AdrianLundell and others added 2 commits March 13, 2026 17:05

Fix review comments

fbabbf8

- Spelling errors - Buck fixes - E2E model test on fvp with new quantizer Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: I43dadbcec22b3b28c65a1c9708790881dd3868a2

Merge branch 'main' into change-1183485

885a81b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arm backend: Add experimental support for new TOSAQuantizer#18100

Arm backend: Add experimental support for new TOSAQuantizer#18100
AdrianLundell wants to merge 6 commits intopytorch:mainfrom
AdrianLundell:change-1183485

AdrianLundell commented Mar 11, 2026 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Mar 11, 2026 •

edited

Loading

Uh oh!

zingo commented Mar 11, 2026 •

edited

Loading

Uh oh!

AdrianLundell commented Mar 11, 2026

Uh oh!

rascani Mar 12, 2026

Uh oh!

AdrianLundell Mar 13, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

digantdesai commented Mar 13, 2026

Uh oh!

AdrianLundell commented Mar 13, 2026

Uh oh!

meta-codesync bot commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

AdrianLundell commented Mar 11, 2026 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18100

❌ 3 New Failures, 3 Unrelated Failures

Uh oh!

zingo commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AdrianLundell commented Mar 11, 2026

Uh oh!

rascani Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

AdrianLundell Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

digantdesai commented Mar 13, 2026

Uh oh!

AdrianLundell commented Mar 13, 2026

Uh oh!

meta-codesync bot commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

AdrianLundell commented Mar 11, 2026 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Mar 11, 2026 •

edited

Loading

zingo commented Mar 11, 2026 •

edited

Loading